Skip to content

Conversation

@shanedsnyder
Copy link

@shanedsnyder shanedsnyder commented Mar 10, 2025

Based on original PR from @Yanlilyu in #954.

Key enhancements are as follows:

  • lots of new statistics/metadata printed
  • support for pretty printing via Rich package (default) or output in CSV format for subsequent analysis
  • support for PyDarshan name filtering from ENH: PyDarshan Report changes to enable name filtering #1017
  • parallelization using ProcessPoolExecutor
  • optional manifest file to support massive amounts of log file input (to avoid command line limits)
  • in-depth testing to exercise various options as part of our CI
  • updated docs to describe usage of these 2 new tools

See docs/usage.rst for details on command line tool options.

@shanedsnyder shanedsnyder changed the base branch from main to snyder/pydarshan-name-filters April 27, 2025 18:06
@shanedsnyder shanedsnyder reopened this Apr 27, 2025
@shanedsnyder shanedsnyder force-pushed the pydarshan-file-based-sorting branch from 16501ab to 3d5d197 Compare April 28, 2025 22:16
@shanedsnyder shanedsnyder reopened this Apr 28, 2025
@shanedsnyder shanedsnyder added this to the 3.4.7 milestone Apr 29, 2025
@shanedsnyder shanedsnyder force-pushed the pydarshan-file-based-sorting branch from 9ab0d49 to 05157b0 Compare April 29, 2025 19:10
@shanedsnyder shanedsnyder changed the title [WIP] New PyDarshan CLI tools for job/file stats for many logs New PyDarshan CLI tools for job/file stats for many logs Apr 29, 2025
@shanedsnyder shanedsnyder changed the title New PyDarshan CLI tools for job/file stats for many logs ENH: new PyDarshan CLI tools for job/file stats for many logs Apr 29, 2025
@shanedsnyder
Copy link
Author

This is based on snyder/pydarshan-name-filters for simplicity now, but once that is merged I'll rebase on main.

@shanedsnyder
Copy link
Author

Example job_stat output:
image

@shanedsnyder
Copy link
Author

Example file_stats output:
image

carns
carns previously approved these changes Apr 30, 2025
@shanedsnyder shanedsnyder force-pushed the pydarshan-file-based-sorting branch from 8066582 to 99634c0 Compare April 30, 2025 02:09
@shanedsnyder shanedsnyder changed the base branch from snyder/pydarshan-name-filters to main April 30, 2025 02:10
@shanedsnyder shanedsnyder dismissed carns’s stale review April 30, 2025 02:10

The base branch was changed.

@shanedsnyder shanedsnyder reopened this Apr 30, 2025
@shanedsnyder shanedsnyder merged commit 0ca6e9e into main Apr 30, 2025
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants